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SEQUENCE LISTING 



1)\GENERAL INFORMATION: 



^i) APPLICANT: Boynton, John E. 

Gillham, Nicholas W. 
Randolph- Anderson, Barbara L. 
Ishige, Fumiharu 
Sato , Ryo 

^LE OF INVENTION: Methods of Conferring PPO- Inhibit ing 
llerbicide Resistance to Plants by Gene Manipulation 

(iii) NUMBER OF SEQUENCES: 24 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Birch, Stewart, Kolasch & Birch, LLP 

(B) STREET: P.O. Box 747 

(C) CITY\ Falls Church 

(D) STATE\ Virginia 

(E) COUNTRY: USA ' 

(F) ZIP: 22B40-34& 

(v) COMPUTER READABLE BORM : 

(A) MEDIUM TYPE: EirCpsy disk 

(B) COMPUTER: IfiMT PC dompatible 

(C) OPERATING SYSTEM :/ PC-DOS/MS-DOS 

(D) SOFTWARE: PaMsnt/n Release #1.0, Version 

(vi) CURRENT APPLICATIOnCdATA : 

(A) APPLICATION NUMBER: US new 

(B) FILING DATE: 30ASEP-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Murphy Jr., Gerald M. 

(B) REGISTRATION NUMBER\28 , 977 

(C) REFERENCE /DOCKET NUMBER: 2185-156P 

(ix) TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: 703 -205 -8000 \ 

(B) TELEFAX: 703-205-8050 



) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
<D) TOPOLOGY: linear" 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 
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(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydomonas reinhardtii 

(B) STRAIN: CC-407 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..47 

(D) OTHER INFORMATION: /product= "porphyric herbicide 
resistance domain" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 

Ala Ala Glu Ala Leu Gly, Ser Phe Asp Tyr Pro Pro Val Gly Ala Val 
15 10 15 

Thr Leu Ser Tyr Pro Leu Ser Ala Val Arg Glu Glu Arg Lys Ala Ser 
20 25 30 

Asp Gly Ser Val Pro Gly Phe Gly Gin Leu His Pro Arg Thr Gin 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 
(iii)' HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..46 

(D) OTHER INFORMATION: /product = "porphyric herbicide 
resistance domain" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Ala Ala Asn Ala Leu Ser Lys Leu Tyr Tyr Pro Pro Val Ala Ala Val 
15 10 15 

Ser He Ser Tyr Pro Lys Glu Ala He Arg Thr Glu Cys Leu He Asp 
20 25 30 
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Gly Glu Leu Lys Gly Phe Gly Gin Leu His Pro Arg Thr Gin 
35 40 45 

{ 2 ) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 

(B) STRAIN: B73 inbred 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..4 6 

(D) OTHER INFORMATION: /product = "porphyric herbicide 
resistance domain" 

"~ (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Ala Ala Asp Ala Leu Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val 
15 10 15 

Thr Val Ser Tyr Pro Lys Glu Ala lie Arg Lys Glu Cys Leu lie Asp. 
20 25 30 

Gly Glu Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin 
35 40 45 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydomonas reinhardtii 

(B) STRAIN: CC-407 
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(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..141 

(D) OTHER INFORMATION : /note= "encodes porphyric herbicide 
resistance domain" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
GCCGCCGAGG CCCTGGGCTC CTTCGACTAC CCGCCGGTGG GCGCCGTGAC GCTGTCGTAC 6 0 

CCGCTGAGCG CCGTGCGGGA GGAGCGCAAG GCCTCGGACG GGTCCGTGCC GGGCTTCGGT 12 0 

CAGCTGCACC CGCGCACGCA G 141 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidops is thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : - - 

(B) LOCATION: 1..138 

(D) OTHER INFORMATION: /note= "encodes porphyric herbicide 
resistance domain" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : . 

GCTGCAAATG CACTCTCAAA ACTATATTAC CCACCAGTTG CAGCAGTATC TATCTCGTAC 6 0 

CCGAAAGAAG CAATCCGAAC AGAATGTTTG ATAGATGGTG AACTAAAGGG TTTTGGGCAA 12 0 

TTGCATCCAC GCACGCAA 13 8 
(2) INFORMATION FOR SEQ ID NO: 6: 



SEQUENCE (_HARAcT.bK±S'f ICS : 

(A) LENGTH: 13 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 

(B) STRAIN: B73 inbred 

(ix) FEATURE: 

(A) NAME / KEY : - 

(B) LOCATION: 1..138 

(D) OTHER INFORMATION: /note= "encodes porphyric herbicide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO.: 6 : 
GCTGCAGATG CTCTATCAAG ATTCTATTAT CCACCGGTTG CTGCTGTAAC TGTTTCGTAT 60 
CCAAAGGAAG CAATTAGAAA AGAATGCTTA ATTGATGGGG AACTCCAGGG CTTTGGCCAG 120 
TTGCATCCAC GTAGTCAA 13 8 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: NO 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..36 

(C) OTHER INFORMATION : /NOTE = "oligonucleotide primer for 
Arabidopsis thaliana" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CTATATTACC CACCAATGGC AG CAGTATCT ATCTCG 3 6 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: "oligonucleotide" 
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(iii) HYPOTHETICAL: NO 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..38 

(C) OTHER INFORMATION: /NOTE = "oligonucleotide primer for Zea 

mays " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GATTCTATTA TCCACCGATG GCTGCTGTAA CTGTTTCG 3 8 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: "oligonucleotide" 
(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. .26 

(D) OTHER INFORMATION: /note= "oligonucleotide primer 
common to both of A. thaliana and Z. mays porphyric herbicide 
resistance domain of PPO." 

/note= "N residues can be inosine 
(I) in addition to G, A, T or C. K = G or T, Y = C or T, S = C or G , 
W = A or T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

KAYTAYCCNC CNATGGSNGC NGTNWS 2 6 

(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: - - 

(A) LENGTH: 2573 base pairs 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS -.not relevant 

(D) TOPOLOGY : not relevant 

(ii) MOLECULAR TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Chlamydomonas reinhardtli 

(B) STRAIN:RS-3 

(ix) FEATURE: 

(A) NAME/KEY : - 

(B) LOCATION:!. .2573 
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( C) OTHER INFORMATION : /note= " encodes protoporphyrinogen 

oxidase " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CTCGAGAGCG 


TTGGAGGAAA 


TCCGTTTGGC 


ACCTGTTCCG 


GCTTCTTTGT 


GTGCACGGCC 


60 


ACGTCCCCCT 


TTCCTGCTAC 


CCGCTCCCCC 


CCGGCTTTAC 


TGCCCCTTCC 


ACTCCTCGGC 


120 


TCCATCCCGA 


TTCCATCCGC 


TCCTCCTCCC 


CCACCTAGAC 


TGTCTACCGT 


CTACCAGTTT 


180 


CTTGGGCAAT 


CATTAACGTA 


ACCCCGCCTC 


CCTGCGCCTG 


CCCCTCCCTC 


CCTCTCCCCC 


240 


CCGCACAGCC 


CGCCGCCGCC 


GAGG CCCTGG 


GCTCCTTCGA 


CTACCCGCCG 


ATGGGCGCCG 


300 


TGACGCTGTC 


GTACCCGCTG 


AGCGCCGTGC 


GGGAGGAGCG 


CAAGGCCTCG 


GACGGGTCCG 


360 


TGCCGGGCTT 


CGGTCAGCTG 


CACCCGCGCA 


CGCAGGTGGG 


CAAGTGCGCG 


CGTGTTGCGG 


420 


GCGGTGTGTT 


GCGGAGGGGA 


GGGTGGTGGG 


GGTTGGGGGT 


GGGGGTGGGG 


GGGATTGGGG 


480 


CGCTGGGTCG 


TATCCCGCGG 


TTGTATCCTC 


GCGCTCCCCT 


CATCCATTCC 


CCCCTTCAAC 


540 


AACACACACG 


GGCGCACACG 


CACCCTCTTT 


GCGCTTACTT 


TGTCTGGTGC 


TCCTTAACAC 


600 


ACXCTTCGCT 


TCATTTTGGT 


GTCTTCTAAC 


ACACACACTT 


GTCCACACAC 


AGGGCATCAC 


660 


CACTCTGGGC 


ACCATCTACA 


GCTCCAGCCT 


GTTCCCCGGC 


CGCGCGCCCG 


AGGG CCACAT 


720 


GCTGCTGCTC 


AACTACATCG 


GCGGCACCAC 


CAACCGCGGC 


ATCGTCAACC 


AGACCACCGA 


780 


GCAGCTGGTG 


GAGCAGGTGT 


GTGTGTGGGG 


GGGTGGGGGG 


GGGGCAGTGG 


ATTTTTGGGC 


840 


TGAGCCCCCT 


GAGCAAAGCG 


ATCCAGGGGG 


GGCGAAGCCC 


CCCAGGATTG 


CCCCTGTCCG 


900 


TGCGTGCGTG 


TGTGCCTGTG 


TCGACAAAAA 


GTACCGTACT 


GGCACAAACC 


GCGAGTGCCA 


960 


CGTATTATTA 


ATTGCAATTA 


CCTATTGTAG 


AAAAATAGAC 


GGCAGGGAAA 


ACTCGGCCGG 


1020 


AGCGAGAAGC 


GACCTCGTGA 


GTCCATGGAC 


ATCTTGACTT 


TCTTCAGTTC 


GCGAGTATAG 


' 1080 


CTCTCGGCCC 


CTAAATATCT 


TACATCCATG 


TATCAAAACA 


TGTCGACGAC 


AAGCGTCTTG 


1140 


GGGCAAGAAT 


GTCGAAATTG 


TTTGCAACAG 


CCAAACCATG 


CGTCCCCGAG 


CCTTACATGT 


1200 


GTCGCGGCCC 


GGGATCCCGC 


GCCCGAGCCC 


GGCTAGCCCT 


TTGCGGTGCT 


TGAGTGGGAT 


1260 


GTGGGTGAGG 


TGCATTTGGG 


ATATCATGGA 


CCGTGAAGTG 


GCGTGGGTAA 


GGTGGCGTGG 


1320 


CGTGGCGGGG 


ACAGGGCATG 


TCGGTGCCTC 


GGCACAGCGT 


TGGCCTAGTG 


GCCAGTCCCG 


1380 


CTGGATGGGC 


TTGCAAGGGT 


GCTGTTCATG 


TCGCCGGTGC 


CCATCGTCAC 


ATCCCCTTGC 


1440 


GCTACATGGG 


GCTCAGCCCA 


TTTTCCAGCT 


GTACAAAGCT 


GACACCCCTT 


GTTGTGTGGC 


1500 


GTCTTGGACC 


CGTGTTGCTT 


CGGAGCTGGC 


CAGAACCCCC 


TGTGGGCACA 


CACACGCACA 


1560 
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CACACACACA 


CACACACACA 


CACACACACA 


CACACACACA 


CACACACACA 


CACACACACA 




CACACACACA 


CACACACACA 


CACACACACA 


CACACACACA 


CACATTTTCG 


TCCTGCAGCC 




CCGAACCCCG 


CCGCCCGTTC 


CACGTCTTCC 


ACCTGCCGCA 


CCCCCCCCCC 


TGCCGCACGC 


1740 


CTGCTCTCAC 


CGCCTCTCCC 


CCCACCCCAT 


CTCCCTGCAG 


GTGGACAAGG 


AC CTGCGCAA 


1800 


CATGGTCATC 


AAG CCCGACG 


CGCCCAAGCC 


CCGTGTGGTG 


GGCGTGCGCG 


TGTGGCCGCG 


1860 


CGCCATCCCG 


CAGGTGTGAG 


GGCGCAGCAG 


CCGGAGGGAT 


GGGCTAGATC 


CTAGTTTCTC : 


1920 


AAAGAGCTCT 


ACAGCCCTAT 


AACCTCGACC 


TGCGACCTTC 


GACCTGATAA 


CCTGGCTGCC 


1980 


CCCTCCCAAC 


CTAGCCACCT 


CTCCCCGGAT 


TTGGGTTCAC 


TCGGTTGACT 


TGCTTTTGGG 


2040 


TTCTGGAATC 


AACTTCACCT 


GTTGTATACT 


TTGCTGCACT 


TCTCTGTACC 


ACTCTTTGCA 


2100 


TTAGGTTCGG 


TTTAGTTTGG 


GCTGCATGTG 


TAACCCCTCC 


TCCCCGCCCT 


GCCACCTGCA 


2160 








CAAGGCGCGC 


AAGGCGCTGG 


ACGCGGCGGG 


2220 


GCTGCAGGGC 


GTGCACCTGG 


GGGGCAACTA 


CGTCAG CGGT 


GAG CGCGTGG 


GCAG CAGCAG 


2280 


CAGCAGGAAG 


AGGGGAGGGG 


AGGGGAGGGG 


AGGGTACAAG 


GAGGAGGTTG 


AGCAGGAGGT 


2340 


GGTG CTAAGG 


CGCAAAGCAA 


GGCGGTGTTG 


TATCCTCATT 


GACTGAAAC C 


GGGAAACCCA 


2400 


GCATGAACAA 


GAGGTCAGGG 


GACTGCAAGG 


AGCGGAGGCT 


ACATGTATGA 


CTACCCCCGA 


2460 


CGCGGGCGAT 


GATTCCTTGA 


CTATTGGGAC 


CTATTTCGTT 


GGGCTCGGGC 


ACATGACCCC 


2520 


CCTGGCCCCT 


TCGCTGTATG 


GTGCCCAGCC 


GCCCAGCCGC 


CCCCCGCCCA 


CAC 


2573 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1704 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabldopsis thaliana. 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS ._ 

(B) LOCATION: 16.. 1629 

(D) OTHER INFORMATION: /product = "protoporphyrinogen 

oxidase" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TTCTCTGCGA TTTCC ATG GAG TTA TCT CTT CTC CGT CCG ACG ACT CAA TCG 
Met Glu Leu Ser Leu Leu Arg Pro Thr Thr Gin Ser 
1 5 10 

CTT CTT CCG TCG TTT TCG AAG CCC AAT CTC CGA TTA AAT GTT TAT AAG 
Leu Leu Pro Ser Phe Ser Lys Pro Asn Leu Arg Leu Asn Val Tyr Lys 
15 20 25 

CCT CTT AGA CTC CGT TGT TCA GTG GCC GGT GGA CCA ACC GTC GGA TCT 
Pro Leu Arg Leu Arg Cys Ser Val Ala Gly Gly Pro Thr Val Gly Ser 
30 35 40 

TCA AAA ATC GAA GGC GGA GGA GGC ACC ACC ATC ACG ACG GAT TGT GTG 
Ser Lys lie Glu Gly Gly Gly Gly Thr Thr lie Thr Thr Asp Cys Val 
45 50 55 60 

ATT GTC GGC GGA GGT ATT AGT..-GGT. CTT TGC ATC GCT CAG GCG CTT GCT 
lie Val Gly Gly Gly He Ser Gly Leu Cys He Ala Gin Ala Leu Ala 
65 70 75 

ACT AAG CAT CCT GAT GCT GCT CCG AAT TTA ATT GTG ACC GAG GCT AAG 
Thr Lys His Pro Asp Ala Ala Pro Asn Leu He Val Thr Glu Ala Lys 
80 85 90 



GAT CGT GTT GGA GGC AAC ATT ATC ACT CGT GAA GAG AAT GGT TTT CTC 

Asp Arg Val Gly Gly Asn He He Thr Arg Glu Glu Asn Gly Phe Leu 
95 100 105 

TGG GAA GAA GGT CCC AAT AGT TTT CAA CCG TCT GAT CCT ATG CTC ACT 

Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp Pro Met Leu Thr 
110 115 120 

ATG GTG GTA GAT AGT GGT TTG AAG GAT GAT TTG GTG TTG GGA GAT CCT 

Met Val Val Asp Ser Gly Leu Lys Asp Asp Leu Val Leu Gly Asp Pro 

125 130 135 140 

ACT GCG CCA AGG TTT GTG TTG TGG AAT GGG AAA TTG AGG CCG GTT CCA 

Thr Ala Pro Arg Phe Val Leu Trp Asn Gly Lys Leu Arg Pro Val Pro 

145 150 155 

TCG AAG CTA ACA GAC TTA CCG TTC TTT GAT TTG ATG AGT ATT GGT GGG 

Ser Lys Leu Thr Asp Leu Pro Phe Phe Asp Leu Met Ser He Gly Gly 

160 165 170 

AAG ATT AGA GCT GGT TTT GGT GCA CTT GGC ATT CGA CCG TCA CCT CCA 

Lys He Arg Ala Gly Phe Gly Ala Leu Gly He Arg Pro Ser Pro Pro 
175 180 185 

GGT CGT GAA GAA TCT GTG GAG GAG TTT GTA CGG CGT AAC CTC GGT GAT 

Gly Arg Glu Glu Ser Val Glu Glu Phe Val Arg Arg Asn Leu Gly Asp 
190 195 200 

GAG GTT TTT GAG CGC CTG ATT GAA CCG TTT TGT TCA GGT GTT TAT GCT 
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Glu Val Phe Glu Arg Leu lie Glu Pro Phe Cys Ser Gly Val Tyr Ala 

205 210 215 220 

GGT GAT CCT TCA AAA CTG AGC ATG AAA GCA GCG TTT GGG AAG GTT TGG 

Gly Asp Pro Ser Lys Leu Ser Met Lys Ala Ala Phe Gly Lys Val Trp 

225 230 235 

AAA CTA GAG CAA AAT GGT GGA AGC ATA ATA GGT GGT ACT TTT AAG GCA 

Lys Leu Glu Gin Asn Gly Gly Ser lie lie Gly Gly Thr Phe Lys Ala 

240 245 250 

ATT CAG GAG AGG AAA AAC GCT CCC AAG GCA GAA CGA GAC CCG CGC CTG 

lie Gin Glu Arg Lys Asn Ala Pro Lys Ala Glu Arg Asp Pro Arg Leu 

255 260 265 

CCA AAA CCA CAG GGC CAA ACA GTT GGT TCT TTC AGG AAG GGA CTT CGA 

Pro Lys Pro Gin Gly Gin Thr Val Gly Ser Phe Arg Lys Gly Leu Arg 

270 275 280 

ATG TTG CCA GAA GCA ATA TCT GCA AGA TTA GGT AGC AAA GTT AAG TTG 

Met Leu Pro Glu Ala lie Ser Ala Arg Leu Gly Ser Lys Val Lys Leu 

285 290 295 300 

TCT TGG AAG CTC TCA GGT ATC ACT AAG CTG GAG AGC GGA GGA TAC AAC 

Ser Trp Lys Leu Ser Gly lie Thr Lys Leu Glu Ser Gly Gly Tyr Asn 

305 310 315 

TTA ACA TAT GAG ACT CCA GAT GGT TTA GTT TCC GTG CAG AGC AAA AGT 

Leu Thr Tyr Glu Thr Pro Asp Gly Leu Val Ser Val Gin Ser Lys Ser 

320 325 330 

GTT GTA ATG ACG GTG CCA TCT CAT GTT GCA AGT GGT CTC TTG CGC CCT 

Val Val Met Thr Val Pro Ser His Val Ala Ser Gly Leu Leu Arg Pro 

335 340 345 

CTT TCT GAA TCT GCT GCA AAT GCA CTC TCA AAA CTA TAT TAC CCA CCA 

Leu Ser Glu Ser Ala Ala Asn Ala Leu Ser Lys Leu Tyr Tyr Pro Pro 

350 355 360 

GTT GCA GCA GTA TCT ATC TCG TAC CCG AAA GAA GCA ATC CGA ACA GAA 

Val Ala Ala Val Ser lie Ser Tyr Pro Lys Glu Ala lie Arg Thr Glu 

365 370 375 380 

TGT TTG ATA GAT GGT GAA CTA AAG GGT TTT GGG CAA TTG CAT CCA CGC 

Cys Leu lie Asp Gly Glu Leu Lys Gly Phe Gly Gin Leu His Pro Arg 

385 390 395 

ACG CAA GGA GTT GAA ACA TTA GGA ACT ATC TAC AGC TCC TCA CTC TTT 

Thr Gin Gly Val Glu Thr Leu Gly Thr lie Tyr Ser Ser Ser Leu Phe 

400 405 410 

CCA AAT CGC GCA CCG CCC GGA AGA ATT TTG CTG TTG AAC TAC ATT GGC 

Pro Asn Arg Ala Pro Pro Gly Arg lie Leu Leu Leu Asn Tyr lie Gly 

415 420 425 

GGG TCT ACA AAC ACC GGA ATT CTG TCC AAG TCT GAA GGT GAG TTA GTG 
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Gly Ser Thr Asn Thr Gly lie Leu Ser Lys Ser Glu Gly Glu Leu Val 
430 435 440 

GAA GCA GTT GAC AGA GAT TTG AGG AAA ATG CTA ATT AAG CCT AAT TCG 13 9 5 

Glu Ala Val Asp Arg Asp Leu Arg Lys Met Leu lie Lys Pro Asn Ser 
445 450 455 460 

ACC GAT CCA CTT AAA TTA GGA GTT AGG GTA TGG CCT CAA GCC ATT CCT 1443 
Thr Asp Pro Leu Lys Leu Gly Val Arg Val Trp Pro Gin Ala lie Pro 
465 470 475 

CAG TTT CTA GTT GGT CAC TTT GAT ATC CTT GAC ACG GCT AAA TCA TCT 1491 
Gin Phe Leu Val Gly His Phe Asp lie Leu Asp Thr Ala Lys Ser Ser 
480 485 490 

CTA ACG TCT TCG GGC TAC GAA GGG CTA TTT TTG GGT GGC AAT TAC GTC 153 9 
Leu Thr Ser Ser Gly Tyr Glu Gly Leu Phe Leu Gly Gly Asn Tyr Val 
495 500 505 

GCT GGT GTA GCC TTA GGC CGG TGT GTA GAA GGC GCA TAT GAA ACC GCG 1587 
Ala Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala Tyr Glu Thr Ala 
510 515 520 

ATT GAG GTC AAC AAC TTC ATG TCA CGG TAC GCT TAC AAG TAA 1629 
lie Glu Val Asn Asn Phe Met Ser Arg Tyr Ala Tyr Lys * 
525 530 535 

ATGTAAAACA TTAAATCTCC CAGCTTGCGT GAGTTTTATT AAATATTTTG AGATATC CAA 1689 
AAAAAAAAAA AAAAA 1704 
(2) INFORMATION FOR SEQ ID NO : 12 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 537 amino acids 

( B ) TYPE : amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: protein - - 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis th.alia.na 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .537 

(C) OTHER INFORMATION: /product="protoporphyrinogen oxidase" 
(xi) SEQUENCE DESCRIPTION: SEQ. ID. NO: 12: 



Met Glu Leu Ser Leu Leu Arg Pro Thr Thr Gin Ser Leu Leu Pro Ser 
15 10 15 
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Phe Ser Lys Pro Asn Leu Arg Leu Asn Val Tyr Lys Pro Leu Arg Leu 
20 25 30 

Arg Cys Ser Val Ala Gly Gly Pro Thr Val Gly Ser Ser Lys He Glu 
35 40 45 

Gly Gly Gly Gly Thr Thr He Thr Thr Asp Cys Val He Val Gly Glv 
50 55 60 

Gly He Ser Gly Leu Cys He Ala Gin Ala Leu Ala Thr Lys His Pro 
65 70 75 80 

Asp Ala Ala Pro Asn Leu He Val Thr Glu Ala Lys Asp Arg Val Gly 
85 90 95 

Gly Asn lie lie Thr Arg Glu Glu Asn Gly Phe Leu Trp Glu Glu Gly 
100 105 110 

Pro Asn Ser Phe Gin Pro Ser Asp Pro Met Leu Thr Met Val Val Asp 
115 120 125 

Ser Gly Leu Lys Asp Asp Leu Val Leu Gly Asp Pro Thr Ala Pro Arg 
130 135 140 

Phe Val Leu Trp Asn Gly Lys Leu Arg Pro Val Pro Ser Lys Leu Thr 
14-S-- 150 155 160 

Asp Leu Pro Phe Phe Asp Leu Met Ser He Gly Gly Lys He Arg Ala 
165 170 175 

Gly Phe Gly Ala Leu Gly He Arg Pro Ser Pro Pro Gly Arg Glu Glu 
180 185 190 

Ser Val Glu Glu Phe Val Arg Arg Asn Leu Gly Asp Glu Val Phe Glu 
195 200 205 

Arg Leu He Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser 
210 215 220 

Lys Leu Ser Met Lys Ala Ala Phe Gly Lys Val Trp Lys Leu Glu Gin 
225 230 235 240 

Asn Gly Gly Ser He He Gly Gly Thr Phe Lys Ala He Gin Glu Arg 
245 250 255 

Lys Asn Ala Pro Lys Ala Glu Arg Asp Pro Arg Leu Pro Lys Pro Gin 
260 265 270 

Gly Gin Thr Val Gly Ser Phe Arg Lys Gly Leu Arg Met Leu Pro Glu 
27b 280 285 

Ala He Ser Ala Arg Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu 
290 295 300 

Ser Gly He Thr Lys Leu Glu Ser Gly Gly Tyr Asn Leu Thr Tyr Glu 
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Thr Pro Asp Gly Leu Val Ser Val Gin Ser Lys Ser Val Val Met Thr 
325 330 335 

Val Pro Ser His Val Ala Ser Gly Leu Leu Arg Pro Leu Ser Glu Ser 
340 345 350 

Ala Ala Asn Ala Leu Ser Lys Leu Tyr Tyr Pro Pro Val Ala Ala Val 
355 360 365 

Ser lie Ser Tyr Pro Lys Glu Ala He Arg Thr Glu Cys Leu He Asp 
370 375 380 

Gly Glu Leu Lys Gly Phe Gly Gin Leu His Pro Arg Thr Gin Gly Val 
385 390 395 400 

Glu Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala 
405 410 415 

Pro Pro Gly Arg He Leu Leu Leu Asn Tyr He Gly Gly Ser Thr Asn 
420 425 430 

Thr Gly He Leu Ser Lys Ser Glu Gly Glu Leu Val Glu Ala Val Asp 
435 440 445 

Arg Asp Leu Arg Lys Met Leu He Lys Pro Asn Ser Thr Asp Pro Leu 
450 455 460 

Lys Leu Gly Val Arg Val Trp Pro Gin Ala He Pro Gin Phe Leu Val 
465 470 475 480 

Gly His Phe Asp He Leu Asp Thr Ala Lys Ser Ser Leu Thr Ser Ser 
485 490 495 

Gly Tyr Glu Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala 
500 505 510 

Leu Gly Arg Cys Val Glu Gly Ala Tyr Glu Thr Ala He Glu Val Asn 
515 520 525 

Asn Phe Met Ser Arg Tyr Ala Tyr Lys * 
530 535 

(2) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1698 base pairs 

(B ) TYPE : nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

<ii) MOLECULAR TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 

(B) STRAIN: B73 inbred 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.-1453 

(C) OTHER INFORMATION: /product= "protoporphyrinogen oxidase" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

G AAT TCG GCG GAC TGC GTC GTG GTG GGC GGA GGC ATC AGT GGC CTC 
Asn Ser Ala Asp Cys Val Val Val Gly Gly Gly lie Ser Gly Leu 
15 10 15 

TGC ACC GCG CAG GCG CTG GCC ACG CGG CAC GGC GTC GGG GAC GTG CTT 
Cys Thr Ala Gin Ala Leu Ala Thr Arg His Gly Val Gly Asp Val Leu 
20 25 • 30 

GTC ACG GAG GCC CGC GCC CGC CCC GGC GGC AAC ATT ACC ACC GTC GAG 
Val Thr Glu Ala Arg Ala Arg Pro Gly Gly Asn lie Thr Thr Val Glu 
35 40 45 

CGC CCC GAG GAA GGG TAC CTC TGG GAG GAG GGT CCC AAC AGC TTC CAG 
Arg Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin 
50 55 60 

— eeC TCC GAC CCC GTT CTC ACC ATG GCC GTG GAC AGC GGA CTG AAG GAT 
Pro Ser Asp Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp 
65 70 75 

GAC TTG GTT TTT GGG GAC CCA AAC GCG CCG CGT TTC GTG CTG TGG GAG 
Asp Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu 
80 85 90 95 

GGG AAG CTG AGG CCC GTG CCA TCC AAG CCC GCC GAC CTC CCG TTC TTC 
Gly Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp Leu Pro Phe Phe 
100 105 110 

GAT CTC ATG AGC ATC CCA GGG AAG CTC AGG GCC GGT CTA GGC GCG CTT 
Asp Leu Met Ser lie Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu 
115 120 125 

GGC ATC CGC CCG CCT CCT CCA GGC CGC GAA GAG TCA GTG GAG GAG TTC 
Gly lie Arg Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe 
130 135 140 

GTG CGC CGC AAC CTC GGT GCT GAG GTC TTT GAG CGC CTC ATT GAG CCT 
Val Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg Leu lie Glu Pro 
145 150 155 

TTC TGC TCA GGT GTC TAT GCT GGT GAT CCT TCT AAG CTC AGC ATG AAG 
Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys 
160 165 170 175 

GCT GCA TTT GGG AAG GTT TGG CGG TTG GAA GAA ACT GGA GGT AGT ATT 
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Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Thr Gly Gly Ser lie 
180 185 190 

ATT GGT GGA ACC ATC AAG ACA ATT CAG GAG AGG AGC AAG AAT CCA~ AAA 
lie Gly Gly Thr lie Lys Thr lie Gin Glu Arg Ser Lys Asn Pro Lys 
195 200 205 

CCA CCG AGG GAT GCC CGC CTT CCG AAG CCA AAA GGG CAG ACA GTT GCA 
Pro Pro Arg Asp Ala Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Ala 
210 215 220 

TCT TTC AGG AAG GGT CTT GCC ATG CTT CCA AAT GCC ATT ACA TCC AGC 
Ser Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala lie Thr Ser Ser 
225 230 235 

TTG GGT AGT AAA GTC AAA CTA TCA TGG AAA CTC ACG AGC ATT ACA AAA 
Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr Ser lie Thr Lys 
240 245 -250 255 

TCA GAT GAC AAG GGA TAT GTT TTG GAG TAT GAA ACG CCA GAA GGG GTT 
Ser Asp Asp Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro Glu Gly Val 
260 265 270 

GTT TCG GTG CAG GCT AAA AGT GTT ATC ATG ACT ATT CCA TCA TAT GTT 
Val Ser Val Gin Ala Lys Ser Val lie Met Thr lie Pro Ser Tyr Val 
275 280 285 

GCT AGC AAC ATT TTG CGT CCA CTT TCA AGC GAT GCT GCA GAT GCT CTA 
Ala Ser Asn lie Leu Arg Pro Leu Ser Ser Asp Ala Ala Asp Ala Leu 
290 295 300 

TCA AGA TTC TAT TAT CCA CCG GTT GCT GCT GTA ACT GTT TCG TAT CCA 
Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro 
305 310 315 

AAG GAA GCA ATT AGA AAA GAA TGC TTA ATT GAT GGG GAA CTC CAG GGC 
Lys Glu Ala lie Arg Lys Glu Cys Leu lie Asp Gly Glu Leu Gin Gly 
320 325 330 335 

TTT GGC CAG TTG CAT CCA CGT AGT CAA GGA GTT GAG ACA TTA GGA ACA 
Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr 
340 345 350 

ATA TAC AGT TCC TCA CTC TTT CCA AAT CGT GCT CCT GAC GGT AGG GTG 
lie Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Asp Gly Arg Val 
355 360 365 

TTA CTT CTA AAC TAC ATA GGA GGT GCT ACA AAC ACA GGA ATT GTT TCC 
Leu Leu Leu Asn Tyr lie Gly Gly Ala Thr Asn Thr Gly lie Val Ser 
370 375 380 

AAG ACT GAA AGT GAG CTG GTC GAA GCA GTT GAC CGT GAC CTC CGA AAA 
Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys 
385 390 395 

ATG CTT ATA AAT TCT ACA GCA GTG GAC CCT TTA GTC CTT GGT GTT CGA 
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Met Leu He Asn Ser Thr Ala Val Asp Pro Leu Val Leu Gly Val Arg 
400 405 410 415 

GTT TGG CCA CAA GCC ATA CCT CAG TTC CTG GTA GGA CAT CTT GAT CTT 1294 
Val Trp Pro Gin Ala He Pro Gin Phe Leu Val Gly His Leu Asp Leu 
420 425 430 

CTG GAA GCC GCA AAA GCT GCC CTG GAC CGA GGT GGC TAC GAT GGG CTG 1342 
Leu Glu Ala Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr Asp Gly Leu 
435 440 445 

TTC CTA GGA GGG AAC TAT GTT GCA GGA GTT GCC CTG GGC AGA TGC GTT 13 90 

Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val 
450 455 460 

GAG GGC GCG TAT GAA AGT GCC TCG CAA ATA TCT GAC TTC TTG ACC AAG 143 8 

Glu Gly Ala Tyr Glu Ser Ala Ser Gin lie Ser Asp Phe Leu Thr Lys 
465 470 475 

TAT GCC TAC AAG TGA TGAAAGAAGT GGAGCGCTAC TTGCCAATCG TTTATGTTGC 14 93 

Tyr Ala Tyr Lys * 

480 

ATAGATGAGG TGCCTCCGGG GAAAAAAAAG CTTGAATAGT ATTTTTTATT CTTATTTTGT 1553 
AAATTG CATT TCTGTTCTTT TTTCTATCAG TAATTAGTTA TATTTTAGTT CTGTAGGAGA 1613 
TTGTTCTGTT CACTGCCCTT CAAAAGAAAT TTTATTTTTC ATTCTTTTAT GAGAGCTGTG 1673 
CTACTTAAAA AAAAAAAAAA AAAAA 1698 
(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 amino acids 

( B ) TYPE : amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: protein . . 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 

(B) STRAIN: B73 inbred 

(ix) FEATURE: 

(A) NAME/KEY: peptide 
<B) LOCATION: 1..483 

(C) OTHER iJMb'UKMATION: /note= " protoporphyr inogen oxidase" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Asn Ser Ala Asp Cys Val Val Val Gly Gly Gly He Ser Gly Leu Cys 
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Thr Ala Gin Ala Leu Ala Thr Arg His Gly Val Gly Asp Val Leu Val 
20 25 30 

Thr Glu Ala Arg Ala Arg Pro Gly Gly Asn He Thr Thr Val Glu Arg 
35 40 45 

Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro 
50 55 60 

Ser Asp Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Asp 
65 70 75 80 

Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu Gly 
85 90 95 

Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp Leu Pro Phe Phe Asp 
100 105 110 

Leu Met Ser He Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu Gly 
115 120 125 

He Arg Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe Val 
130 135 140 

Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg Leu He Glu Pro Phe 
145 150 155 160 

Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala 
165 170 175 

Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Thr Gly Gly Ser He He 
180 185 190 

Gly Gly Thr He Lys Thr He Gin Glu Arg Ser Lys Asn Pro Lys Pro 
195 200 205 

Pro Arg Asp Ala Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Ala Ser 
210 215 220 

Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala He Thr Ser Ser Leu 
225 230 235 240 

Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr Ser He Thr Lys Ser 
245 250 255 

Asp Asp Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro Glu Gly Val Val 
260 265 270 

Ser Val Gin Ala Lys Ser Val He Met Thr He Pro Ser Tyr Val Ala 
275 2RQ 285 

Ser Asn He Leu Arg Pro Leu Ser Ser Asp Ala Ala Asp Ala Leu Ser 
290 295 300 

Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro Lys 
305 310 315 320 
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Glu Ala lie Arg Lys Glu Cys Leu lie Asp Gly Glu Leu Gin Gly Phe 
325 330 335 

Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr 'lie 
340 345 350 

Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Asp Gly Arg Val Leu 
355 360 365 

Leu Leu Asn Tyr lie Gly Gly Ala Thr Asn Thr Gly He Val Ser Lys 
370 375 380 

Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys Met 
385 390 395 400 

Leu He Asn Ser Thr Ala Val Asp Pro Leu Val Leu Gly Val Arg Val 
405 410 415 

Trp Pro Gin Ala He Pro Gin Phe Leu Val Gly His Leu Asp Leu Leu 
420 425 430 

Glu Ala Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr Asp Gly Leu Phe 
435 440 445 

Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val Glu 
450 455 460 

Gly Ala Tyr Glu Ser Ala Ser Gin He Ser Asp Phe Leu Thr Lys Tyr 
465 470 475 480 

Ala Tyr Lys 

(2) INFORMATION FOR SEQ ID NO:15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 nucleotides 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: oligonucleotide 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..18 

(C) OTHER INFORMATION: /note= "oligonucleotide primer 1A fo 
Chlamydomonas relnhardzii" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15 

CCGTCTACCA GTTTCTTG 
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(2) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 nucleotides 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: oligonucleotide 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: YES 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..19 

(C) OTHER INFORMATION: /note= " oligonucleotide primer IB for 
Chlamydomonas reinhardtii" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 16 

ATACAACCGC GGGATACGA 

(2) INFORMATION FOR SEQ ID NO: 17 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 nucleotides 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: oligonucleotide 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..17 

(C) OTHER INFORMATION: /note= 11 oligonucleotide primer 2A for 
Chlamydomonas reinhardtii" 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17 

ACTTTGTCTG GTGCTCC 

(2) INFORMATION FOR SEQ ID NO:18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 nucleotides 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULAR TYPE: oligonucleotide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..17 

(C) OTHER INFORMATION: /note= "oligonucleotide primer 2B for 
Chla.mydomon.as reinhardtii" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18 

TGGATCGCTT TGCTCAG 

( 2 } INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 338i base pairs 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULAR TYPE: DNA ( genomic ) 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Chlamydomonas reinhardtii 

(B) STRAIN: RS-3 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..3381 

(C) OTHER INFORMATION: /note=» " encodes protoporphyrinogen 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CTCGAGAGCG TTGGAGGAAA TCCGTTTGGC ACCTGTTCCG GCTTCTTTGT GTGCACGGCC 6 0 

ACGTCCCCCT TTCCTGCTAC CCGCTCCCCC CCGGCTTTAC TGCCCCTTCC ACTCCTCGGC 12 0 

TCCATCCCGA TTCCATCCGC TCCTCCTCCC CCACCTAGAC TGTCTACCGT CTACCAGTTT 180 

CTTGGGCAAT CATTAACGTA ACCCCGCCTC CCTGCGCCTG CCCCTCCCTC CCTCTCCCCC 24 0 

CCGCACAGCC CGCCGCCGCC GAGGCCCTGG GCTCCTTCGA CTACCCGCCG ATGGGCGCCG 300 

TGACGCTGTC GTACCCGCTG AGCGCCGTGC GGGAGGAGCG CAAGGCCTCG GACGGGTCCG 3 60 

TGCCGGGCTT CGGTCAGCTG CACCCGCGCA CGCAGGTGGG CAAGTGCGCG CGTGTTGCGG 420 

GCGGTGTGTT GCGGAGGGGA GGGTGGTGGG GGTTGGGGGT GGGGGTGGGG GGGATTGGGG 4 80 
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CGCTGGGTCG 


TATCCCGCGG 


TTGTATCCTC 


GCGCTCCCCT 


CATCCATTCC 


CCCCTTCAAC 


540 - 




AACACACACG 


GGCGCACACG 


CACCCTCTTT 


GCGCTTACTT 


TGTCTGGTGC 


TCCTTAACAC 


600 




ACTCTTCGCT 


TCATTTTGGT 


GTCTT CTAAC 


ACACACACTT 


GTCCACACAC 


AGGGCATCAC 


660 




CACTCTGGGC 


ACCATCTACA 


GCTCCAGCCT 


GTTCCCCGGC 


CGCGCGCCCG 


AGGGCCACAT 


720 




GCTGCTGCTC 


AACTACATCG 


GCGGCACCAC 


CAACCGCGGC 


ATCGTCAACC 


AGACCACCGA 


780 




GCAG CTGGTG 


GAGCAGGTGT 


GTGTGTGGGG 


GGGTGGGGGG 


GGGGCAGTGG 


ATTTTTGGGC 


840 




TGAGCCCCCT 


GAGCAAAGCG 


ATCCAGGGGG 


GGCGAAGCCC 


CCCAGGATTG 


CCCCTGTCCG 


900 




TGCGTG CGTG 


TGTGCCTGTG 


TCGACAAAAA 


GTACCGTACT 


GGCACAAACC 


GCGAGTGCCA 


960 


o 


CGTATTATTA 


ATTGCAATTA 


CCTATTGTAG 


AAAAATAGAC 


GGCAGGGAAA 


ACTCGGCCGG 


1020 




AGCGAGAAGC 


GACCTCGTGA 


GTC CATGGAC 


ATCTTGACTT 


TCTTCAGTTC 


GCGAGTATAG 


1080 


y 


CTCTCGGCCC 


CTAAATATCT 


TACATCCATG 


TATCAAAACA 


TGTCGACGAC 


AAGCGTCTTG 


1140 


St 


GGGCAAGAAT 


GTCGAAATTG 


TTTGCAACAG 


CCAAACCATG 


CGTCCCCGAG 


CCTTACATGT 


1200 


ru 


GTCGCGGCCC 


GGGATCCCGC 


GCCCGAGCCC 


GGCTAGCCCT 


TTGCGGTGCT 


TGAGTGGGAT 


1260 


L 


GTGGGTGAGG 


TGCATTTGGG 


ATATCATGGA 


CCGTGAAGTG 


GCGTGGGTAA 


GGTGGCGTGG 


1320 


CO 


CGTGGCGGGG 


ACAGGGCATG 


TCGGTGCCTC 


GGCACAGCGT 


TGGCCTAGTG 


GCCAGTCCCG 


1380 


£ 


CTGGATGGGC 


TTGCAAGGGT 


GCTGTTCATG 


TCGCCGGTGC 


CCATCGTCAC 


ATCCCCTTGC 


1440 




GCTACATGGG 


GCTCAGCCCA 


TTTTCCAGCT 


GTACAAAGCT 


GACACCCCTT 


GTTGTGTGGC 


1500 




GTCTTGGACC 


CGTGTTGCTT 


CGGAGCTGGC 


CAGAACCCCC 


TGTGGGCACA 


CACACGCACA 


15.60 




CACACACACA 


CACACACACA 


CACACACACA 


CACACACACA 


CACACACACA 


CACACACACA 


1620 




CACACACACA 


CACACACACA 


CACACACACA 


CACACACACA 


CACATTTTCG 


TCCTGCAGCC 


1680 




CCGAACCCCG 


CCGCCCGTTC 


CACGTCTTCC 


ACCTGCCGCA 


CCCCCCCCCC 


TGCCGCACGC 


1740 




CTGCTCTCAC 


CGCCTCTCCC 


CCCACCCCAT 


CTCCCTGCAG 


GTGGACAAGG 


ACCTGCGCAA 






CATGGTCATC 


AAGCCCGACG 


CGCCCAAGCC 


CCGTGTGGTG 


GGCGTGCGCG 


TGTGGCCGCG 


1860 




CGCCATCCCG 


CAGGTGTGAG 


GGCGCAGCAG 


CCGGAGGGAT 


GGGCTAGATC 


CTAGTTTCTC 


1920 




AAAGAGCTCT 


ACAGCCCTAT 


AACCTCGACC 


TGCGACCTTC 


GACCTGATAA 


CCTGGCTGCC 


1980 




CCCTCCCAAC 


CTAGCCACCT 


CTCCCCGGAT 


TTGGGTTCAC 


TCGGTTGACT 


TG CTTTTGGG 


2040 




TTCTGGAATC 


AACTTCACCT 


GTTGTATACT 


TTGCTGCACT 


TCTCTGTACC 


ACTCTTTGCA 


2100 




TTAGGTTCGG 


TTTAGTTTGG 


GCTGCATGTG 


TAACCCCTCC 


TCCCCGCCCT 


GCCACCTGCA 


2160 



W ° 98/29554 PCT/US96/20415 



GTTCAAC CTG 


GGCCACCTGG 


AG CAGCTGGA 


CAAGGCGCGC 


AAGGCGCTGG 


ACGCGGCGGG 


2220 


GCTGCAGGGC 


GTGCACCTGG 


GGGGCAACTA 


CGTCAGCGGT 


GAGCGCGTGG 


GCAGCAGCAG 


2280 


CAGCAGGAAG 


AGGGGAGGGG 


AGGGGAGGGG 


AGGGTACAAG 


GAGGAGGTTG 


AG CAGG AGGT 


2340 


GGTGCTAAGG 


CGCAAAGCAA 


GGCGGTGTTG 


TATCCTCATT 


GACTGAAACC 


GGGAAACCCA 


2400 


GCATGAACAA 


GAGGTCAGGG 


GACTGCAAGG 


AGCGGAGGCT 


ACATGTATGA 


CTACCCCCGA 


2460 


CGCGGGCGAT 


GATTCCTTGA 


CTATTGGGAC 


CTATTTCGTT 


GGGCTCGGGC 


ACATGACCCC 


2520 


CCTGGCCCCT 


TCG CTGTATG 


GTGCCCAGCC 


GCCCAGCCGC 


CCCCCGCCCA 


CACGTGTGCC 


2580 


CACGCCTTTG 


CCTCATCCCC 


AACCCCCTCG 


GCCCCTCTCC 


CCCCTCGAAC 


CCCTGCAACC 


2640 


AGGTGTGGCC 


CTGGGCAAGG 


TGGTGGAGCA 


CGGCTACGAG 


TCCGCAGCCA 


ACCTGGCCAA 


2700 


GAGCGTGTCC 


AAGGCCGCAG 


TCAAGGCCTA 


AGCGGCTGCA 


GCAGTAGCAG 


CAGCAGCATC 


2760 


GGGCTGTAGC 


TGGTAAATGC 


CGCAGTGGCA 


CCGGCAGCAG 


CAATTGG CAA 


GCACTTGGGG 


2820 


CAAGCGGAGT 


GGAGG CGAGG 


GGGGGG CTAC 


CATTGG CGCT 


TGCTGGGATG 


TGTAGTAACA 


2880 


GTTGGAATGG 


ATCGGGGATG 


TGGAGCTAGG 


GGTTCGGGGG 


TCTGCCAAGG 


ACATAGGTGG 


2940 


TGCTGGGATG 


AGCGATGTGG 


TTGGTAAAGC 


TCTGTCGGCA 


CCGTTATGTG 


CGGGTTAACT 


3000 


GCACTATGAC 


GCTCCGTTGT 


ACAGCCCCGT 


TGTGCATTGT 


TTGCATGAAG 


TTTTGGCGAG 


3060 


AGTGAGTTGG 


CGCACACGCG 


GGGCGGTTTG 


GGGGCACTGT 


CCCTCAGTGT 


GGTCCCAGCA 


3120 


TAGCACAGGA 


GAGACACAGA 


ACTGAGTGAC 


ATAGACTAGG 


TCTCGAAGTA 


CCTTCAAAAG 


318 0 


GGGG CTATAA 


ATTGCGAATA 


CCCGGAGCAG 


GGGGCCAGAC 


CCAAGGCATT 


GACTGTCAGT 


324 0 


GCACAAGCGA 


AAGAC CAATT 


GCATGGGTTG 


CTTCCGTGGT 


GGGAAGAGGA 


GGGCAGGGGA 


3300 


GCATCGTCAG 


GTGTATGTTG 


CGGCTTCGCC 


CATAAGTGCC 


ATGGTTTCGA 


AGATGCTTAA 


3360 


GACTAACAAT 


GCCAACTCGA 


G 








3381 


(2) INFORMATION FOR SEQ ID NO: 20 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 nucleotides 
{ B ) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

iii) MOLECULAR TYPE: oligonucleotide 
(iii) HYPOTHETICAL: NO 
(iv) ■ ANTI-SENSE: NO 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..18 

(C) OTHER INFORMATION: /note= "oligonucleotide primer 3A for 
Chlamydomonas reinhardtii " 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

TTCCACGTCT TCCACCTG 

(2) INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 nucleotides 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: oligonucleotide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 17 

(C) OTHER INFORMATION; /note= " oligonucleotide primer 3B for 
Chlamydomonas reinhardtii" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21 

CTAGGATCTA GCCCATC 

(2) INFORMATION FOR SEQ ID NO: 22 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 nucleotides 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: oligonucleotide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION : 1 . . 18 

(C) OTHER INFORMATION: /note= " ol igonucleot ide primer 4A for 
Chlamydomonas reinhardtii" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 22 
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CTGC&TGTGT AACCCCTC 

(2) INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTICS: 

y(A) LENGTH: 18 nucleotides 
B ) TYPE : nucleic acid 
S) STRANDEDNESS : single 
(D^TOPOLOGY: linear 

(ii) MOLECULAR. TYPE : oligonucleotide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: 

(ix) FEATURE: 

(A) NAME/KEY\ - 

(B) LOCATION :VL . . 18 

(C) OTHER INFORMATION: /note= "oligonucleotide primer 4B for 
Chlamydomonas reinhardtx)L" 

(xi) SEQUENCE DESCRIPTION\SEQ ID NO: 23 

GACCTCTTGT TCATGCTG 

( 2T1NFORMATION FOR SEQ ID NO^ 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 nucleotid 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: oligonucleotide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) OTHER INFORMATION: /note= " oligonucleotide primer 5B for 
Chlamydomonas reinhardtii" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 24 



CGGCATTTAC CAGCTAC 



